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A system for generating recombinant, human parainfluenza virus, particularly infectious, recombinant, human parainfluenza virus type 
3 (HPIV-3) is provided. In one embodiment, the system comprises a clone comprising a nucleotide sequence that encodes a full-length, 
positive sense, anti-genome of HPIV, and at least one support clone comprising a nucleotide sequence that encodes the HPIV P protein 
and the HPIV L protein. In another embodiment, the system further comprises a support clone which comprises a nucleotide sequence 
that encodes the HPIV NP protein. The present invention also provides a clone which comprises a nucleotide sequence encoding the 
full-length, positive sense, anti-genome of HPIV-3. The clone also comprises an RNA polymerase promoter operatively linked to the 
HPIV-3 antigenome-encoding sequence. In a preferred embodiement, the clone further comprises a nucleotide sequence which encodes a 
ribozyme immediately downstream from the sequence encoding the HPIV-3 anti-genome. The present invention also relates to a method 
of preparing recombinant HPIV-3 virus having site-specific mutations in the HPIV-3 genome. The method comprises preparing a clone 
comprising a modified HPIV-3 antigenome-encoding sequence; introducing the modified HPIV-3 clone and support clones which comprise 
nucleotide sequences encoding an HPIV-3 P protein, an HPIV-3 L protein, and, preferably, an HPIV-3 NP protein into host cells; and 
culturing the host cells under conditions that allow for synthesis of the modified HPIV-3 antigenome and the L, P, and NP proteins of 
HPIV-3. 
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An Infectious Clone for Human Parainfluenza Virus Type 3 

This work was supported in part by National Institutes of Health Grant No. 32027 from the 
Institute of Allergy and Infectious Diseases, The government may have rights in this invention. 

BACKGROUND 

Recognized in 1956 as a cause of respiratory infection in man, human parainfluenza viruses 
5 (HPIV) are believed to account for 4 to 22 percent of the respiratory illnesses in children, second only to 
respiratory syncytial virus in this regard. HPIV are important causes of the lower respiratory tract diseases 
such as pneumonia, and bronchiolitis, and are the most common cause of croup in young children. Of the 
four HPIV serotypes, 1-4, type 3 virus (HPIV-3), appears to be the most virulent, frequently causing 
bronchiolitis and pneumonia during the first month of life, 

10 Unfortunately, effective vaccines or antiviral therapies, which can be used to prevent or treat 

HPIV-induced infections, are not presently available. Standard methods which are used to produce inactive 
viruses, such as heat inactivation or chemical treatment of the virus, have been unsuccessful with all HPIV 
strains and serotypes, including HPIV-3. Moreover, standard methods for producing attenuated viruses 
produce mutations at random sites and do not allow one to modify the HPIV genome at specific sites or to 

1 5 control the number of mutations that are introduced into genome. 

Human parainfluenza viruses are enveloped, single -stranded, negative sense RNA viruses that are 
members of the paramyxovirus genus within the family Paramyxoviridae. Replication of the human 
parainfluenza viral genome (vRNA) is similar to that of other members of the Paramyxoviradae family. 
Upon infection of a cell, transcription is the major RNA synthetic event, resulting in the production of the 

20 viral mRNAs from the negative-sense genome, i.e., the vRNA. Later in infection a transition to RNA 
replication occurs, resulting in synthesis of a full-length, antigenomic, positive-sense RNA. which serves as 
the template for synthesis of additional negative-sense genomic RNA. Transcription and replication of the 
genomic RNA is dependent upon formation of a ribonucleoprotein complex (RNP) consisting of the 15462 
nucleotide genomic RNA encapsidated by the nucleocapsid protein (NP), and the closely associated 

25 phosphoprotein (P), and the large (L) polymerase protein. Several host cell factors are also involved in the 
replicative cycle of HPIV. 

The requirement for an intact RNP for HPIV has hindered analysis of HPIV transcription and 
replication in a cell-free system. Efforts to encapsidate HPIV-3 vRNA in vitro have failed, and unlike the 
positive sense RNA viruses, naked HPIV vRNA is not infectious. Moreover, there currently are no known 

30 systems for preparing recombinant HPIV, including recombinant infectious HPIV-3. 
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Accordingly, there is a need for new reagents, systems, and methods that enable one to produce a 
recombinant HPIV, particularly a recombinant, infectious HPIV-3. Recombinant systems that permit one 
to introduce one or more site-specific mutations into the genome of HPIV, particularly HPIV-3, are 
desirable. Recombinant systems which allow one to characterize the effect of site-specific mutations on 
5 the transcription or replication of human parainfluenza viral RNA and to identify the site specific mutations 
which lead to the production of attenuated HPIV are especially desirable. 

SUMMARY OF THE INVENTION. 
In accordance with the present invention a system for generating recombinant, human 
10 parainfluenza virus, particularly infectious, recombinant, human parainfluenza virus type 3 (HPIV-3) is 
provided. In one embodiment, the system comprises a clone comprising a nucleotide sequence that 
encodes a full-length, positive sense, anti-genome of HPIV, and at least one support clone comprising a 
nucleotide sequence that encodes the HPIV P protein and the HPIV L protein. In another embodiment, the 
system further comprises a support clone which comprises a nucleotide sequence that encodes the HPIV 
1 5 NP protein. Preferably, each of the clones in the system comprises an RNA polymerase promoter which is 
operatively linked to the respective HPIV nucleotide sequence contained within the clone. 

The present invention also provides a clone which comprises a nucleotide sequence encoding the 
full-length, positive sense, anti-genome of HPIV-3. The clone also comprises an RNA polymerase 
promoter operatively linked to the HPIV-3 antigenome-encoding sequence. In a preferred embodiment, 
20 the clone further comprises a nucleotide sequence which encodes a ribozyme immediately downstream 
from the sequence encoding the HPIV-3 anti-genome. 

The present invention also relates to a method of preparing recombinant HPIV-3 virus having site- 
specific mutations in the HPIV-3 genome. The method comprises preparing a clone comprising a modified 
HPIV-3 antigenome-encoding sequence; introducing the modified HPIV-3 clone and support clones which 
25 comprise nucleotide sequences encoding an HPIV-3 P protein, an HPIV-3 L protein, and, preferably, an 
HPIV-3 NP protein into host cells.; and culturing the host cells under conditions that allow for synthesis of 
the modified HPIV-3 antigenome and the L, P, and NP proteins of HPIV-3. 

The ability to produce recombinant, HPIV-3 virus genetically engineered to contain site-specific 
mutations within the HPIV-3 genes and cis-acting elements expedites the study of all aspects of the virus 
30 replication cycle. Additionally, a system which permits production of recombinant HPIV that is 
genetically engineered to contain site-specific mutations within the HPIV-3 genome is useful for 
identifying attenuating parainfluenza genotypes and for developing a live vaccine for human parainfluenza 
virus. 

3 5 BRIEF DESCRIPTION OF THE FIGURES 
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Figure 1 depicts the DNA form of the nucleotide sequence of the HPIV-3 genome and shows the 
location of restriction sites, the leader sequence, the trailer sequence, and the protein encoding regions of 
the genome. 

Figure 2 is a restriction map of the pOCUS-2™ vector. 
5 Figure 3 is a restriction map of pMG(+) showing the location of the leader sequence, luciferase 

encoding region, and theT7 promoter and terminator. 

Figure 4 is a restriction map of pHPIV-3 showing the location of the leader sequence and the 
protein encoding regions of HPIV-3 . 

Figure 5 is a schematic depiction of the full-length infectious clone, pHPIV-3. VV<|>, vaccinia 
1 0 virus polymerase stop signal (TTTTTNT); T7, T7 RNA polymerase promoter; le, HPIV-3 leader sequence; 
NP, P, M, F, HN and L are the HPIV-3 protein coding regions; tr, HPIV-3 trailer sequence; Rz, the 
hepatitis delta virus antigenomic ribozyme; T7<t>, T7 RNA polymerase terminator signal. Regions 
containing substitution mutations are expanded and shown above with the specific changes indicated. The 
A to G change at viral base 94 creates a Sad site and the A to G change at viral base 15389 creates a Stul 
15 site. 



DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the present invention a system for generating recombinant, human 
parainfluenza virus is provided. In a preferred embodiment, the system is used to generate recombinant 
20 HPIV-3, The system comprises a clone comprising a nucleotide sequence, preferably a double-stranded 
DNA sequence, which encodes a full-length, positive sense anti-genome of HPIV hereinafter referred to as 
the "HPIV clone", and one or more support clones which comprise nucleotide sequences that encode an 
HPIV P protein and an HPIV L protein. The nucleotide sequences that encode the HPIV P protein and 
HPIV L protein may be within the same clone. However, for ease of manipulation, it is preferred that the 
25 nucleotide sequences that encode the HPIV P protein and the HPIV L protein be on separate clones. 
Preferably the HPIV clone comprises a sequence encoding an HPIV-3 antigenome. Preferably the support 
clone or clones encode a P protein and an L protein of HPIV-3. In another embodiment the system further 
comprises a support clone which comprises a nucleotide sequence that encodes the HPIV NP protein, 
preferably the HPIV-3 NP protein. 
30 As used herein "clone" refers to double-stranded DNA that can be introduced into a cell and 

expressed. The clone may be in the form of a viral vector such as, for example, a vaccinia viral vector, or, 
preferably, in the form of a plasmid. Preferably, the HPIV clone and the support clones each comprise an 
RNA polymerase promoter, more preferably a T7 RNA polymerase promoter. Each of the RNA 
polymerase promoters is operatively linked to the corresponding HPIV encoding sequence in the clone. 
35 Thus, the RNA polymerase promoter on the HPIV clone is operatively linked to the HPIV sequence and 
the RNA polymerase on the support clones are operatively linked to the sequence or sequences encoding 
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the respective HPIV protein. Preferably, the plasmids comprising the clone also comprise an origin of 
replication, particularly a bacterial origin of replication. 

The present invention also provides a clone which comprises a nucleotide sequence encoding the 
anti-genomic sequence of HPIV-3, hereinafter referred to as the "HPIV-3 clone". Preferably, the HPIV-3 
5 clone encodes a full-length antigenomic sequence of HPIV-3. As used herein "full-length" means that the 
anti-genomic sequence is complementary to the entire negative sense, genomic sequence of HPIV-3 
extending from the 3' nucleotide of the leader sequence through the 5'nucleotide of the trailer sequence of 
the HPIV-3 genome. The DNA form of the full-length, genomic sequence of HPIV-3, SEQ ID NO:l, is 
shown in Fig. I. In addition to the leader and trailer sequences, the HPIV-3 clone contains sequences 
10 encoding the HPIV-3 proteins N, P, M, F, HN, and L, as well as the cis-acting elements. The HPIV-3 
clone may encode a wild-type HPIV-3 antigenome sequence or a modified HPIV-3 antigenome having one 
or more mutations contained therein. The mutation may be in the form of a foreign gene which is inserted 
into the HPIV-3 antigenome-encoding sequence. Preferably, the mutations are substitutions of one or more 
nucleotides, deletions of 6 to 12 nucleotides, or additions of 6 to 12 nucleotides in the HPIV-3 
1 5 antigenome-encoding sequence. More preferably, the modified HPIV-3 clone contains substitutions either 
in the genes or the cis-acting elements, or both of the HPIV-3 antigenome-encoding sequence. 

Preferably, the HPIV-3 clone is a plasmid that comprises a nucleotide sequence which encodes a 
ribozyme, more preferably the hepatitis delta virus antigenomic ribozyme, immediately downstream from 
the HPIV antigenome-encoding sequence. Following transcription of the clone, the ribozyme cleaves the 
20 ribozyme from the HPIV antigenome to provide a replication competent 3' end on the antigenome. More 
preferably, the HPIV-3 clone also comprises an RNA polymerase terminator following the ribozyme 
sequence. In one embodiment, the HPIV-3 clone is the plasmid pHPIV-3 depicted in Fig. 5, which 
plasmid was deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD 

20852, USA, on May , 1998 and has Accession Number . 

25 The present invention also relates to a method of preparing recombinant HPIV, particularly 

HPIV-3, using the above described system. The method comprises introducing an HPIV clone and the 
support clones which encode HPIV P protein and HPIV L protein, into host cells, preferably human cells; 
culturing the host cells under conditions that allow for formation of an HPIV anti-genomic transcript, 
synthesis of the HPIV genome (vRNA) and the HPIV proteins L, P, and NP, and formation of a 
30 recombinant HPIV; and recovering the recombinant HPIV from the culture. Preferably the host cells which 
are transfected with the HPIV clone and support clones, contain an RNA polymerase that corresponds to 
the RNA polymerase promoter that is operatively linked to the HPIV sequences in the HPIV clone and the 
support clones! In a preferred embodiment, a support clone comprising the nucleotide sequence which 
encodes the HP1V-NP protein operatively linked to an RNA polymerase promoter is also introduced into 
35 the cells. Preferably, the host cells are infected with a viral recombinant, preferably a vaccinia virus 
recombinant, which expresses the RNA polymerase, more preferably the T7 RNA polymerase, prior to or 
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in combination with transfection with the HPIV clone and support clone or clones. When such cells are 
infected with the vaccinia virus recombinant, it is preferred that the HPIV-3 clone also comprise a vaccinia 
virus RNA polymerase terminator upstream of the T7 RNA polymerase and a vaccinia virus RNA 
polymerase terminator downstream of the T7 RNA polymerase terminator. 
5 The present invention also relates to a method of introducing site-specific mutations into the 

genome of a recombinant HPIV-3. The method comprises preparing a a modified HPIV-3 clone 
comprising one or more mutations in the sequence which encodes the HPIV-3 anti-genome; introducing 
the modified HPIV-3 clone and support clones comprising sequences which encode HPIV-3 P protein and 
HPIV-3 L protein, and preferably, HPIV-3 NP protein into host cells; and culturing the host cells under 

10 conditions that allow for formation of a modified HPIV-3 antigenomic transcript and synthesis of the 
HPIV-3 L, P, and NP proteins. The modified HPIV-3 clone and the support clones contain an RNA 
polymerase promoter that is operativeiy linked to the HPIV-3 protein-encoding sequences. The host cells 
contain within the cytoplasm thereof an RNA polymerase that corresponds to the RNA polymerase 
promoter on the modified HPIV-3 clone and the support clones. 

1 5 Preferably, the modified HPIV-3 clone, containing one or more mutations therein, is made by 

conventional PCR techniques using an HPIV-3 clone as a template. The mutations are made in the cis- 
acting elements of the HPIV-3 sequence or in an HPIV-3 protein encoding sequence. Preferably the 
mutation is made in the L protein-encoding sequence. If mutations are made in the HPIV-3 protein- 
encoding sequences of the HPIV-3 clone, it is preferred that a similar type of mutation be made in the same 

20 site in the protein encoding sequence of the corresponding support clone. For example, if a mutation is 
made at a specific site in the L protein-encoding sequence of the HPIV-3 clone, it is preferred that the same 
mutation be made at the same site in the L protein-encoding sequence of the L protein-encoding support 
clone. Such method is useful for identifying mutations that block the synthesis of viral particles or result in 
the production of non-infectious or non-virulent HPIV-3. 

25 To determine whether the mutated viruses produced by the above-described method are non- 

virulent, i.e., attenuated, the mutated viruses are first tested in vitro to determine whether the mutation has 
resulted in a slower growing phenotype, i.e., the mutated virus grows more slowly in tissue culture than the 
wild-type virus. The mutated viruses which exhibit this phenotype are then examined in vivo, by injection 
into an animal, such as the cotton rat, which is good experimental model for parainfluenza virus. The 

30 infected animals are then examined to determine if they are producing antibodies to HPIV-3 and to 
determine if there is a reduction in the severity of symptoms as compared to animals infected with wild- 
type virus. 

The ability to produce= recombinant HPIV-3 virus genetically engineered to contain specific 
alterations within the HPIV-3 genes and cis-acting elements expedites the study of all aspects of the virus 
35 replication cycle. Additionally, a system which permits production of recombinant HPIV that is 
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genetically engineered to contain specific alterations within the HPIV-3 genes is useful for identifying 
attenuating parainfluenza genotypes and for developing a live vaccine for human parainfluenza virus. 

The following examples of methods of preparing a full-length cDNA clone of HPIV-3 and 
methods of preparing a modified or mutated, infectious, recombinant HPIV -3 are for purposes of 
5 illustration and are not intended to limit the scope of the invention. 
Example 1 . Construction of a Full-Length cDNA Clone of HPIV-3. 

The construction of a full-length infectious clone of HPIV-3 containing mutations at specific sites 
was achieved by a two-step process. The initial step was the generation of a minireplicon which contained 
the positive sense leader portion region and trailer regions of HPIV-3. The second step involved the 
10 insertion of RT-PCR fragments derived from HPIV-3 genomic RNA into the minireplicon.The positive- 
sense minireplicon contained the following: A T7 promoter which directed the synthesis of two non-viral G 
residues, followed by the positive-sense leader region of HPIV-3, a portion of the NP 5' UTR (to viral base 
97), the Iuciferase gene, a portion of the L 3'UTR (starting at viral base 15387) and trailer sequences of 
HPIV-3. The full-length nucleotide sequence of HPIV-3 genome and the location of the leader sequence, 
15 trailer sequence and protein-encoding regions is shown in Figure 1. The hepatitis delta virus antigenomic 
ribozyme followed to effect precise cleavage after the 3 terminal HPIV-3 specific base. A T7 RNA 
polymerase terminator was also incorporated into the replicon followed the ribozyme sequence. 
Additionally, vaccinia virus polymerase termination signals were inserted immediately upstream and 
downstream of the aforementioned sequences. During the construction, single base changes were created 
20 in the regions encoding the NP 5' UTR and the L 3 UTR. An A to G change at viral base 94 and the A to 
G change at base 15389 created Sad and StuI sites, respectively, which served as genetic tags to identify 
virus as being of recombinant origin. 

The vector pOCUS-2 (Novagen) was chosen as the starting plasmid for preparing the mini- 
replicon [pPIV3-MG(+)], because of its small size (1930 bp). It is believed that the use of a small starting 
25 plasmid may increase the stability of the full-length clone. 

The mini-replicon was constructed by generating PCR products encoding the leader and trailer 
regions flanked by a T7 promoter and hepatitis delta virus antigenomic ribozyme, respectively. The 
primers used for synthesis of the T7 promoter/leader region were: 5'- 
TAGTCGGCCC TAATACGACTCACTATAGG ACCAAACAAGAGAAGAA 
30 ACT-3\ SEQ ID NO:2, and 5 ' -G A A ATTATAG AGCTCCCTTTTCT-3 ' , SEQ ID NO:3. The first primer 
encodes an EagI site and the T7 promoter (underlined) and the second primer introduced an A to G base 
change at viral base 94, (bold) within the 5' untranslated region (UTR) of the NP mRNA, which creates a 
Sad site: The template for this reaction pHPIV3-CAT, described in De, B. P. and A. K. Banerjee. (1993.) 
Rescue of synthetic analogs of genome RNA of human parainfluenza virus type 3. Vir. 196:344-348, 
35 which is incorporated herein by reference. The resulting PCR product was cloned into the EagI and Sad 
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sites of pOCUS-2, which is depicted in Figure 2. The primers used for synthesis of the trailer/ribozyme 
region were: 

S'-TAAGGCCTAAAGATAGACAAAAAGTAAGAAAAACATGTAATATATATA 
TACCAAACAGAGTTCTTCTCTTGTTTGGTGGGTCGGCATGGCATCTC-3 > , 

5 SEQ ID NO:4, and 5 ' -CTGGGTAC CTCCCTTAGCCATCCG AGT -3 ' , SEQ ID NO:5. The first primer 
contains sequence from the 3' UTR of the L mRNA, through the trailer, and primes synthesis of the 
ribozyme (underlined). Also, an A to G change at viral base 15389 (bold), which creates a Stul site within 
the 3'UTR of the L mRNA is encoded by this primer. The second primer encodes the 3' end of the 
ribozyme (underlined) and a Bglll site. The template for this PCR reaction was pSAl, a plasmid 

10 containing the ribozyme sequence, as previously described in Perrotta, A. T. and M. D. Been. 1991. The 
pseudoknot-like structure required for efficient self-cleavage of hepatitis delta virus RNA. "Nature 
350:434-436, which is incorporated herein by reference. The PCR product derived from this reaction was 
cloned into the Stul and Bglll sites of pOCUS-2. The leader and trailer regions were combined into a 
single clone by transferring the Eagl/PstI fragment of the T7/leader clone into the PacI/PstI sites of the 

1 5 trailer/ribozyme clone. 

To prevent possible interference by transcription from cryptic vaccinia virus promoters, vaccinia 
virus polymerase transcription stop signals (TTTTTNT) were inserted upstream and downstream of the 
replicon near PvuII and Sspl sites within pOCUS-2. A T7 transcription termination signal was removed 
from pET-17b by digesting with BIpl and BspEI, and inserted into the Sspl site (blunted with T4 DNA 

20 polymerase) of pOCUS-2. A luciferase reporter gene was then inserted into the Sacl and Stul sites to 
create pPIV3-MG(+), which is schematically depicted in Figure 3. 

To generate the full-length HPIV-3 clone, five RT-PCR products were generated from HPIV-3 
virion RNA and cloned. These fragments were subsequently inserted into P P1V3-MG(+), replacing the 
luciferase coding sequences, to create pHPIV-3, the full-length clone. The five RT-PCR products were 
25 generated from HPIV-3 strain 47885 virion RNA which was obtained from Robert Chanock, at the 
National Institues of Health. These PCR products, encompassing the remainder of the HPIV-3 genome, 
were identified by restriction enzyme analysis and cloned, either in pUCI9 or pOCUS-2, and then inserted 
into pPIV3-MG(+). 

The first PCR product containing viral bases 83 to 2721 was inserted into the Smal site of pUC19. 

30 The 83/2721 clone was then digested with Sacl and XmnI, removing viral bases 94 to 553 which were 
inserted into the Sacl and SphI (blunt with T4 DNA polymerase) of pPIV3-MG(+). The 83/2721 clone 
was then digested with PstI to remove a fragment containing viral bases 540 to 2274, which was then 
inserted into the PstI site of the pPIV3-MG(+) clone containing the 94/554 fragment. The second PCR 
product encompassing viral bases 13395 to 15397 was cloned into the Smal site of pUC19. This 

35 13395/15397 clone was then digested with Stul and Pad and the resulting fragment containing viral bases 
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13632 to 15381 was inserted into the StuI and Pad sites of the pPIV3-MG(+) clone containing viral 
sequence to base 2274. The resulting clone contained viral bases 1 to 2274 and 13632 to 15462 in the 
pPIV3-MG(+) context. 

The third PCR product containing viral bases 7403 to 11513 was digested with BspMI (blunted 
with T4 DNA polymerase) and Xhol to produce a fragment containing bases 7437 to 1 1444 which was 
inserted into the Xhol and Sspl sites of pOCUS-2. The fourth PCR fragment containing viral bases 10904 
to 13773 was digested with PvuII and BamHI (viral bases 10918 to 13733) and inserted into the EcoRI 
(blunted with T4 DNA polymerase) and BamHI sites of pUC19. The two viral segments were combined 
by digesting the 7437/1 1444 clone with Sad (blunted with T4 DNA polymerase) and EcoNI and inserting 
into the 10918/13733 clone digested with EcoRI (blunted with T4 DNA polymerase) and EcoNI. The 
resulting clone contained viral bases 7437 to 13733 in a pUC19 background. The remainder of the viral 
sequence was derived from a fifth PCR product encompassing viral bases 83 to 7457 which had been 
digested with Xmnl and Xhol (viral bases 553 to 7437) and cloned into the StuI and Xhol sites of pOCUS- 
2. The 7437/13733 clone was then digested with BamHI, blunted with T4 DNA polymerase, and digested 
with Xhol to release a fragment that was inserted into the Eagl (blunted with T4 DNA polymerase) and 
Xhol digested 553/7437 clone. The resulting clone contained viral bases 553 to 13733. This clone was 
then digested with PshAI and Pad and the resulting fragment containing viral bases 2143 to 13632 was 
inserted into the same sites of the pPIV3-MG(+) clone containing viral bases 1 to 2274 and 13632 to 
15462. This generated pHPIV-3, the infectious clone, which is schematically depicted in Figure 4. 

To insert the P gene into pGEM-4, P sequences were transferred from a P-lac-fusion clone, 
(described in 38, which is incorporated herein by reference) by digesting with Xbal (blunted with T4 DNA 
polymerase) BamHI, and inserted into the Kpnl (blunted with T4 DNA polymerase) and BamHI sites of 
pGEM4. The pPIV3-NP and pPIV3-L clones, as described in 15, 16, which are incorporated herein by 
reference., were in a pGEM-4 background. The pPIV3-L clone was also modified. In the natural L mRNA 
sequence there is a non-initiating AUG 1 1 nucleotides from the 5' end of the transcript. This was removed 
from pPIV3-L by mutational PCR, changing viral bases 8636 and 8637 from AT to TA. 

Example 2. Preparation of Recombinant HPIV-3. 

Confluent monolayers of HeLa cells in 6-well plates were infected with recombinant vaccinia 
virus vTF7-3 at a multiplicity of infection of 2. vTF7-3 expresses T7 RNA polymerase as described in. 
Fuerst, T. R., P. L. Earl, and B. Moss. 1987. "Use of a hybrid vaccinia virus-T7 RNA polymerase system 
for expression of target genes." Mol. Cell. Biol. 7:2538-2544, and Fuerst, T. R., E. G. Niles, F. W. 
Studier, and B. Moss. 1986. "Eukaryotic transient-expression system based on recombinant vaccinia virus 
that synthesizes bacteriophage T7 RNA polymerase." Proc. Natl. Acad. Sci. 83:8122-8126, which are 
incorporated herein by reference. After 1 hour at 37°C, pPIV3-NP, pPIV3-P> pPIV3-L and pHPIV-3 were 
transfected using Lipofectin (BRL) according to manufacturers instructions. After three hours the 
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transfection medium was removed and replaced with 1.5 ml Dulbecco's modified Eagle's medium 
(DMEM)/5% fetal bovine serum. After 40 to 48 hours the plates were frozen, thawed and scraped. The 
clarified medium supernatant (250 u.1) was then used to infect fresh HeLa cell monolayers in 6-well plates. 
DMEM (1.5ml) containing 25 ug/ml l-B-D-arabinofuranosylcytosine (araC) to inhibit vaccinia virus 
5 replication was added after a 1 hour attachment. After forty hours the plates were frozen, thawed, and 
scraped. The clarified medium supernatant was then titered for HPIV-3 in the presence of AraC. During 
the titering, isolated plaques were picked as agar plugs. The agar plugs were placed in 500 ml opti-MEM 
at 4°C for 4 hr. 250 ul were then used to infect fresh HeLa cell monolayers for amplification of the plaque 
isolates for 40 hr. 

10 In a preferred embodiment transfection conditions were 1 \ig pHPIV-3, 2 ug pPIV3-NP, 4 ug 

pPIV3-P and 0.1 ug pPIV3-L. Under these conditions approximately 1000 pfu per 6 X 10 s cells were 
obtained during the initial transfection and 10 6 pfu per 6 x 10 s cells after the amplification. 

When individual plasmids were omitted from the transfection step it was observed that the 
pHPIV-3, pPIV3-P and pPIV3-L plasmids were required for recovery of virus but, surprisingly, as shown 
1 5 in Table I below, pPIV3-NP was not. 



20 



Table 1 



Plasmids Transfected 
HPIV-3 
+ 

+ 
+ 
+ 



NP 

+ 

+ 
+ 



P 

+ 
+ 



L 
+ 
+ 
+ 
+ 



Virus Recovery 
14/15* 
0/2 
3/3 b 
0/3 
0/2 



25 



Table 1. Recovery of HPIV-3 from pHPIV-3. HeLa cell monolayers were infected with vTF7-3 and 
transfected with the indicated plasmids. After 40 hr cells were lysed and supematants added to fresh HeLa 
cell monolayers in the presence of araC to inhibit vTF7-3 replication. These monolayers were then lysed 
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and the supernatants assayed for HPIV-3. The number of experiments for which HPIV-3 was recovered 
per attempts is displayed under virus recovery. 

A The single experiment that did not yield HPIV-3 used 0.1 ug of the P plasmid. 
B The omission of the NP plasmid resulted in 3 to 5 fold lower titers of HPIV-3. 



Characterization of Recovered Virus. 

In order to characterize the recovered virus and to purify HPIV-3 from vTF7-3, plaques of which 
are only slightly smaller than those of HPIV-3, isolated plaques suspected to be HPIV-3 were picked and 
amplified in HeLa cells. The plaque purified and amplified virus isolates and appropriate controls were 
then analyzed in neutralization assays. Virus (isolates #3 and #5) was preincubated (30 min on ice) with 5 
ul rabbit pre-immune serum, 5 ul rabbit polyclonal anti-HPIV-3 antisera, or assayed in the presence of 
25ug/ml araC The sera was incubated with virus on ice for 30 min prior to a standard plaque assay. To 
allow maximal plaque development, the plates were then incubated at 37°C for 66 hr prior to staining with 
crystal violet. The results of the assay indicated that the plaque purified virus was completely inhibited by 
the anti-HPIV-3 antisera, while vTF7-3 was not. In contrast, the HPIV-3 isolates were not inhibited by 
AraC, whereas the vTF7-3 virus was completely inhibited. Interestingly, of the eight recombinant HPIV-3 
isolates, four had plaque sizes identical to the parental HPIV-3 stock while four were slightly larger. The 
plaque size of isolate #3 was slightly larger than isolate #5 and the wild type HPIV-3 virus. 

To determine whether the NP coding sequence of pHPIV-3, which shares the same position as 
luciferase in pPIV3-MG(+), was being expressed from pHPIV-3, pHPIV-3 and pPIV3-NP were separately 
transfected into vTF7-3 infected HeLa cells and cell lists prepared after 48 hr. The lysates were then 
analyzed by Western blotting using an anit-HPIV-3RNP antisera. This antisera recognizes primarily NP 
and reacts poorly with P. 

Specifically, extracts (equivalent to 6xl0 4 cells) were run on SDS-10% PAGE and transferred to 
nitrocellulose membranes. The primary antibody was a rabbit polyclonal and- RNP antisera diluted 1 : 1000. 
The secondary antibody was 1:1000 dilution of a goat anti-rabbit antibody conjugated to horseradish 
peroxidase. Visualization was through chemiluminescence (ECL kit, Amersham). As shown by the 
Western blot, HPIV-3 RNP recognized NP from the pPIV3-NP and pHPIV-3 transfected cell extracts and 
from purified HPIV-3 RNP. No proteins were recognized in a mock-transfected HeLa extract. Thus, it 
appears that NP is expressed from pHPIV-3, presumably being translated from the T7-directed, 
antigenomic RNA transcript. 

To determine whether the recovered, recombinant virus had specific mutations in its genome, 
RNA was extracted from wild type and plaque isolated viruses and used for RT-PCR analysis using 
primers flanking the substitution mutations. Viral RNA was isolated from approximately 2xl0 7 plaque 
forming units (pfu) of plaque purified virus isolates #3, 5, 7 and 9, or wt HPIV-3 strain 47885/. Reverse 
transcription was carried out using Superscript II reverse transcriptase (BMB) at 44°C for I hr using 
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oligonucleotides which primed at viral base 23 or 15100. The PCR was carried out with Expand Long 
polymerase (BMB) using second primers which result in amplification of viral bases 23 to 303, or 15100 to 
15440. 

PCR products encompassing viral bases 1 to 324 and 15080 to 15462 were generated from the 
5 indicated isolates, digested with Sacl and StuI, respectively, and analyzed on a 1.4% agarose gel.. PCR 
products of the expected sizes were generated in a RT-dependent manner, indicating that the PCR products 
were derived from RNA rather than contaminating plasmid DNA. 

As shown on the agarose gel, the sizes of the 1 to 324 and 15080 to 15462 PCR products are 
increased by 21 and 22 base pairs, respectively, over the length of the viral specific regions due to the 
1 0 inclusion of restriction enzyme sites in the amplification primers. Digestion with Sacl showed that the 
mutation at base 94 was not present in the wild-type virus but was present in the plaque isolated viruses, 
indicating they are of recombinant origin. Similarly, PCR product derived from the region encompassing 
viral base 15389 of wild type HP1V-3 was not cleaved by StuI. However, only four of the eight plaque 
isolated viruses contained the mutation which creates the StuI site. Direct sequencing of the PCR products 
1 5 confirmed these results. 

DISCUSSION 

A full-length plasmid clone of the HPIV-3 genome, pHPIV-3 was constructed.. Upon transfection 
of pHPIV-3 and plasmids encoding the viral NP, P and L proteins into vTF7-3-infected HeLa cells, 
recombinant HPIV-3 bearing genetic markers was efficiently recovered. Several interesting features of this 
20 system were noted. First, the viral NP protein could be expressed from the infectious clone, and this 
expression obviated the need for an NP support plasmid. It is believed that the NP protein is synthesized 
directly from the primary anitgenomic transcript after it is capped by the vaccinia virus capping enzyme. 

Second, two recombinant viruses with distinct genotypes and phenotypes were produced, 
probably due to recombination between the pHPIV-3 and pPIV3-L plasmids, although the possibility that 
25 the reversion arises during RNA replication cannot be excluded. pPIV3-L contains the entire L 3' UTR and 
part of the trailer region, extending to base 15437, an overlap of 48 base pairs beyond the StuI site. This is 
ample room for recombination between plasmids to occur readily in vaccinia virus infected cells 

From these results, there appears to be selection in the HPIV-3 system. AH the large plaque virus 
isolates had reverted to a wild-type sequence at base 15389, while retaining the change at base 94. Since 
30 A94G is the only known alteration in these viruses from the parental virus, it appears that The isolates 
which retained both mutations had a plaque size identical to that of the parental (wild type) virus, but when 
the 15389 mutations was lost, plaque size increased, indicating that the mutation at base 15389 was 
detrimental in the context of the A94G mutation. There was one other known change between pHPIV-3 
and the support plasmids. A non-initiating AUG exists in the natural L protein message. Since this AUG 
35 is only 1 1 nucleotides from the 5' end of the L mRNA and in a poor translation initiation context, it may 
not cause much interference with L mRNA translation. However, in the support plasmid pPIV3-L this 
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non-initiating AUG is much further from the 5 1 end of the transcript where it is more likely to be 
recognized by ribosomes. This AUG was removed from pPIV3-L by changing bases 8636 and 8637 from 
AT to TA, destroying a SphI site and creating a Nhel site. To investigate whether this change was present 
in the recombinant virus and could be responsible for the large plaque phenotype, RT-PCR analysis was 
5 done. PCR products encompassing this site and derived from both the wild type and the plaque isolated 
viruses retained a wild type sequence, indicating recombination had not occurred over this region and that 
these changes could not account for any variance in plaque size. 

The finding that recombination may occur between transfected plasmids indicates that care must 
be taken when introducing mutations into the paramyxovirus or rhabdovirus infectious clone systems. 

10 Mutations introduced within the NP, P or L sequences preferably are carried by both the support plasmids 
and the infectious clone. Otherwise, resultant virus may not carry the desired mutation. The only possible 
exception to this is the HPIV-3 system, in which the HPIV-3 infectious clone expresses NP, negating the 
need for the NP support plasmid. Still, it is preferred that the HPIV-3 NP support plasmid be included in 
the system, since significantly greater yields of HPIV-3 were obtained when the support NP plasmid was 

1 5 included in the transfection. 

An infectious clone for HPIV-3 is useful for understanding the molecular biology of HPIV-3 and 
for developing a vaccine for this important pathogen. The ability to generate specific mutations within 
HPIV-3 makes all aspects of HPIV-3 replication amenable to study. Any mutation, including those studied 
previously in other contexts, can now be examined with this system. The ability to introduce specific 

20 mutations also permits the possibility of revertant analysis, which could refine our understanding of 
protein-protein or protein-RNA interactions. 

The infectious clone is also useful for identifying mutations which attenuate the virus. Such virus 
is useful for developing new vaccine strains of HPIV-3. In addition, mutations present in a current 
candidate vaccine strain of HPIV-3 can be inserted into pHPIV-3. Through identifying multiple deleterious 

25 mutations, it should be possible to engineer several mutations affecting various steps in the virus life cycle 
into a single HPIV-3 strain. Such a virus should be highly attenuated and not readily able to revert. 

While the invention has been described to some degree of particularity, various adaptations and 
modifications can be made without departing from the scope of the invention as defined in the appended 
claims 
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CLAIMS 

What is claimed is: 

1 . A recombinant HPI V clone comprising: 

a. a nucleotide sequence encoding a positive sense, antigenome of human 
parainfluenza virus; and 

b. an RNA polymerase promoter operatively linked to said nucleotide sequence. 

2. The clone of claim 1 wherein the nucleotide sequence encodes the antigenome of HPIV-3. 

3. The clone of claim 2 wherein said clone further comprises a ribozyme sequence downstream from said 
antigenome-encoding sequence. 

4. The clone of claim 2 wherein the RNA polymerase promoter is the 17 RNA polymerase promoter. 

5. The clone of claim 3 wherein the ribozyme is an antigenomic ribozyme. 

6. The clone of claim 3 further comprising an RNA polymerase terminator downstream of the ribozyme 
sequence. 

7. The clone of claim 2 wherein the nucleotide sequence encodes a modified anti-genome of HPIV-3. 

8. The clone of claim 2 wherein the clone has the characteristics of a plasmid deposited with the 
American Type Culture Collection and having Accession Number . 

9. The clone of claim 7 wherein the HPIV-3 antigenome-encoding sequence comprises a mutation selected 
from the group consisting of a substitution of one or more nucleotides, a deletion of from 3 to 12 
nucleotides, and an addition of from 3 to 12 nucleotides. 

10. A method for preparing recombinant human parainfluenza virus comprising; 

a. providing a recombinant system which comprises: 

i. an HPIV clone comprising a nucleotide sequence encoding a positive 
sense, antigenome of human parainfluenza virus; 
ii a support clone comprising a nucleotide sequence encoding a human 
parainfluenza virus L protein; and 

iii. a support clone comprising a nucleotide sequence encoding a human 
parainfluenza virus P protein, 
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wherein the nucleotide sequence encoding the P protein and L 
protein may be on the same support clone or on separate support 

clones; 

b. introducing the recombinant system of step (a) into host cells; 

c. culturing the host cells of step (b) for a time sufficient to permit 
transfection of the host cells and formation 

of recombinant human parainfluenza virus; and 

d. recovering the recombinant human parainfluenza virus from the culture of 
transfected host cells. 

1 1 . The method of claim 1 1 wherein the antigenome-encoding sequence of the HPIV clone is 
operatively-linked to an RNA polymerase promoter; 

wherein the P protein-encoding sequences of the support clones is operatively-linked to an RNA 
polymerase promoter ; 

wherein the L protein-encoding sequences of the support clones is operatively-linked to an RNA 
polymerase promoter; and 

wherein the host cells comprise an RNA polymerase corresponding to 
the RNA polymerase promoter of said HPIV clone and said support clones. 

12. The method of claim 11, wherein the host cells are infected with a viral recombinant capable of 
expressing the RNA polymerase prior to or in combination with introduction of the recombinant system 
into the host cells. 

13. A host cell for producing a recombinant human parainfluenza virus, said host cell comprising: 

a. an HPIV clone comprising a nucleotide sequence encoding a positive 
sense, antigenome of human parainfluenza virus; 
b a support clone comprising a nucleotide sequence encoding a human 
parainfluenza virus L protein; and 

c. a support clone comprising a nucleotide sequence encoding a human 
parainfluenza virus P protein; 

wherein the nucleotide sequence encoding the P protein and L 
protein may be on the same support clone or on separate support 

clones, 

14. The host cell of claim 13 further comprising a support clone comprising a nucleotide sequence 
encoding a human parainfluenza virus NP protein. 
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15. The host cell of claim 13 wherein the HP1V clone comprises a nucleotide sequence encoding an anti- 
genome of HPIV-3. 

16. The host cell of claim 15 wherein the anti-genome encoding sequence of HPIV-3 comprises a site- 
specific mutation. 

17. The host cell of claim 14 wherein the HPIV-3 clone further comprises an RNA polymerase promoter 
operatively linked to the HPIV-3 antigenomic sequence and wherein each of the support clones comprises 
an RNA polymerase promoter operatively linked to the HPIV-3 protein-encoding sequence of said support 
clone. 

1 8. A method of introducing a site-specific mutation into the genome of a recombinant human 
parainfluenza virus, comprising the following steps: 

a. preparing a clone comprising a nucleotide sequence encoding a human 
parainfluenza viral antigenome having a mutation at a specifc site; 

b. co-transfecting host cells with the clone of step (a), 

a support clone comprising a nucleotide sequence encoding an HPIV L 
protein, and a support clone comprising a nucleotide sequence encoding an HPIV P 

protein; 

wherein the nucleotide sequence encoding the P protein and L 
protein may be on the same support clone or on separate support 
clones; and 

c. culturing the transfected host cells for a time sufficient to allow formation of a 
recombinant human parainfluenza virus. 

19. The method of claim 18 further comprising the step of transfecting the host cells with a support clone 
comprising a nucleotide sequence encoding an HPIV NP protein. 

20. The method of claim 18 wherein the clone of step (a) is prepared using polymerase chase reaction 

techniques and a clone comprising a nucleotide sequence encoding a human parainfluenza virus 
antigenome as a template, wherein the antigenome-encoding nucleotide sequence of said template clone 

lacks the site-specific mutation.. 
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gcQQCtcQQtcQQtQgQgagttc^t^QQQgcgQtgctQQQQQCtQtcQQQtcQtggQttcttgggQQgQggQaccQQgQ 1840 

cgttgQgttQgttQtctctcQQCtQcctttcgctQcgQtttttgQtQgtttQgtQcctQQgaacccttctccttggttct 

• ■ • | • « • • | • 

1793 1826 

1826 
HboII 

BsoRI EcoRI Bbal 

I I I 

gQtQQQtCQQCtQQtQtCtCCtCggCCCtCQQCQtCQttgQQttCQtQCtCQgcaCCgQCCCCClQgQQgQCCtQtCggQ 1920 

ctQt t togt tgQt tQtagaggQgccgggQgt t gtQgtQQct tQQgtQtgagtcgtggctgggggt tct tctggatagcct 

• I i • j • ♦ j • • 

1858 1880 1907 

1907 

Haell Hindll 
I I 

QQQCgQCQCQQtCQQCQCQQgQQCCCQgCQQCtCQgcgCCaCCQtCtgtCQQCCOgQQQtCQQQCCQQCQgQQQCQQgtg 2000 

1 1 t gctgtgt tQgt tgtgt tct tgggtcgt tgagtcgcggtggtQgacQgttggtctttQgt ttggttgtctttgttcac 

• ■ «|*|« « • * 

1955 1968 

Ms 1 1 Bspl407I Sspl 
II I 

QQQQQgttQgtggQtCQQCtgQCQaQQQtQgQCQgtCtgggtCQtCQCQCgQQtgtQCQQCQgQQgCQQQQgQtQgQQQt 2080 

tttttcQQtcacctagttgactgttt t tatctgtcagacccagtQgtgtgcttQCQtgt tgtct tcgttttctatctt ta 
■ • • • i • I * * I* 
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Gsul 
EcoBII 

Hbvl PshAI BsnAI 

Bell 8spl407I HboII BsmAI BsAI 

I I II I I II I 

QttgQtcQggQQQCtgtQCQgggQggQtctgggQgQQgQQgcQgctcQgatQgtQgQgctgQgactgtggtctctggQgg 2160 
tQQctQgtcctttgacQtgtccctcctQgQccctcttcttcgtcgQgtctQtcQtctcgactctgQCQCCQgagacctcc 
I | * I *l * II II i ' 

2083 2095 2115 2141 2149 

2121 2143 2150 

2149 2.54 

SfaNI Sspl 

I I 

QQtctctggQQgcQtcQCQgQttctQQQQQtggaQCCCQQQQCQcggagaQtQttgQtctcQQtgQQattQgQQQgQtgg 2240 

t tagagaccttcgtagrgtctQagattt ttacct tgggttttgtgcctct tataactagagtractt taQtctttctacc 

• | • • « | • • • 

2173 2210 
Plal PstI EcoRV 

I I I 

QtaaggQctctQttgQgQggQQQQtgcgQcaQtctgcagQtgt tccQQgcgQgQtQtCQggaagtgQtgtcatatttQCQ 2320 
tQttcctgQgQtQQctctccttttQcgctgttagQcgtctQCQQggttcgctctQtQgtccttcactacQgtatQQQtgt 
| • ♦ | * I * * * 

2246 2274 2293 

Hindi 1 1 
Bell HboII 
I'll 

QCQgQQCQQQgtQgQQQCQgtgQtcQtggQQgQQgcttggQQcctQtcQgtQCQCCtgQtQCQQgttcQQtgagtgttgt 2400 

tgtcttgtttcQtctttgtcQctQgtQccttcttcgQQCCttggQtQgtcQtgtggactQtgttctagttQctcQCQQCQ 
« • | | * | • * • • • 

2341 2349 

2353 

HboII HboII 
Bbvl HboII Fokl HboII HboII 

I II III I 

tactgctgcgacQCCQgQtgQtgaagaQgQQatQctaatgaaaaatQgtQggatgaQgQQaQgttcttcaQcacQCCQQg 2480 
QtgQcgQcgctgtggtctactQcttcttctttQtgQttQctttttQtcQtcctacttcttttcQQgaagttgtgtggttc 
| . -II- ' 'II* I ' I 

2405 2423 2451 2465 2480 

2426 BsrI 2455 

QQgQtgQCQQQQgQQttQQQQQagggggggggggQQOQgggQQQgactggtttQQgQQQteaQgQgQtactgacQQCCQg 2560 
ttctQctgttttcttQQttttttcccccccccccttttccctttctgaccaaattctttagttctctatgactgttggtc 

• • • ' i | ■ • • • 

2526 

OCQtcaQCQtCQgOtCQCQQQCCCQCatCQaQQgggCQQQQgQQaQtCtCQQQQQCQQCQaCCQCCQQCaCCgQCQCQQQ 2640 

tgtagt tgtagtctagtgt ttgggtgtQgttttcccgttttcttttagagtttttgttgttggcggt tgtggctgtgttt 
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ggggCQQQCQgQQQCQCQgQCQgQQtCQtCQgQQQCQCQQtCCCCQtCQtggQQtCCCQttQtCgQCaQCQQCQCtgQCC 2720 

ccccgtttgtct t tgtgtctgtcttagtagtct ttgtgttaggggtagtacct tagggtaatagctgttgttgttactgg 

gQQCcgaacagacaQgcacaacccccccaQcaacaac t cccoga t caac t c g t QCQaaagaa t caa t ccgaacaQac t c t 2800 
cttggct tgtctgt tcgtgttgggggggt tgt tgt tgagggtctagt tgagcatgt tt tct tagt taggcttgtttgaga 

Hboll 

Mfal Ksp632I Clol 

I I I 

gQQtccQQQCccQQgacQCQQQagQCQattggQQQggQQQggaQggQtQcagaQgQgQgcQQtcgatttacQgQgQgggc 2880 
cttaggtttgggt tctgtgttt tctgttQQCctttcctt tccttcctQtgtcttctctcgt tagctQQQtgtctctcccg 

2826 2852 2862 

2852 
BsqBI 
I 

QQttQCtCtQttgCQgQQtCttggtgtQQttCQQtCtQCQCtQQQQCtQgQtttQtQtCQQgQCQQQCgQgttgtQtgtg 2960 

t tQQt gagataacgtc t tagaaccacQt tQQgt tagatgtagt 1 1 tgatctGaatatagt tct gt 1 1 gctcaacatQCQC 

• • • * j • • • 

2930 

SfoNI 

I 

tagcQQtitgtQctQQQCQQtgtagQtQctgcQtcaQagatQgQct tcctagcaggQttQggcatQggggtttcQQtggac 3040 

QtcgtttQcatgQtttgttQCQtctQtgQcgtQgtttctQtctgaaggatcgtcctQQtcQgtQtccccQQQgttQcctg 

• #!«•••• 

2990 

Vspl 

AsqI Bglll MboII 

I I I 

QQtgQCQCQaaQttQQtQCQgQtQCQQQQtgQQQtgttQaaCCtCQQQgCQgQtCtQQQgQgQQtggQCgQQtCQCQtQg 3120 

ttQctgtgttttQQt tatgtctatgt tttQct ttQCQQtttggQgtttcgtctagatttctcttQcctgcttagtgtQtc 

• | • • ♦ • | • • | 

3052 3091 3120 

3052 Bell BsqRI 

I ' I 

aagat tgatagaaaQt caaagagaocaac tgtcat tgatcacatcgt tQQt t tcQQQtct taaaat tatgactgagagag 3200 
t tctQQCtatct tt tagtt tctct tgt tgacagtaactQgtgtQgcaQt tQaagtt tagaat t ttaatactgactctctc 

• • • j • • ♦ • |* 

3156 3198 

Bell MboII 
I I 

gQggQQQgaQQgQccQQQQt gQQ t ccqq t gogagagta tctQtgat caagacQQiiat t gaaagQGgaQaaga t cQagQQQ 3280 
ctcctttctttctggttttQcttQggttQctctctcQtQgQtQctagttctgttttQQCtttcttcttttctQgttcttt 

• • • • | • • | • • 

3243 3263 
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Bell MboII 

I I 

gQggQQQgQQQgQCCaQQQtgQQtCCQQtgagQgQgtQtCtQtgQtCQQgQCQQQQttgQQQgQQgQQQQgQtCQQgQQQ 3280 

ctcctttctttctggttttacttaggttactctctcatagatactagttctgttttoactttcttcttttctagttcttt 

• • • • | • • | t • 

EcoRII 3243 3253 

SexAI BsqBI Nspl 

II II 
accogg 1 1 1 gacccac 1 1 at ggaggcacaagg t q 1 1 gacQagaQ tat acc t go tc ta t a t cgacat gcoggaaQ t acg 1 1 3360 
tggtccQQQCtgggtgaQtacctccgtgttccataactgttct tQtQtggQCtagatGtagctgtQcgtcctt tatgcQQ 
|] ' * * ' I * I 

3281 3332 3343 

3282 PIqI 

Bcql 
II 

QgQgQQcgQcgtQCQQgttQQatcQgQgQtQttQQgttcQtQCQQcgagtcQaQtgcQQCQQgQCtQQtQctcQgQQQQg 3440 

tctcttgttgcatgttcaat ttagtctctatQQttcQQgtQtgt tgctcagt ttacgt tgt tctgattatgggtctt ttc 

• • • « 1 1 • • • • 

3406 

Spel 3407 
I 

tgQgCQgtQCQQtgagQtCQCtQgttgCQgtCatCQQCQQCagCQQtCtCCCQCQQQgCQgQQQQCQQtCQtQtQtQQQC 3520 

QCtcgtcatgt tQctctQgtgatcaQcgtcagtagttgttgtcgt tQgQgggtgtttcgtcttt tgttQgtatQtatttg 
■ !«••••• 

3460 Hpal 

Nspl Hindll 
BsqOI HboII An III MboII Mfel n 

I I I I I I P stop 

qaactcaaacQttqcaaaQgtgQtqaaqaagtatctqQQttqatqqacatqttcQatqaaqQtqttQQcaQttqcfta^aq 3600 
cttqQgtttqtaQcgttttcQctQcttcttcQtaqacttaQctQcctgtQcaQgt tact tctQCQQttqttQQcgat ttc. 
I I I * I'll* 

3530 3545 3567 3578 3589 

3567 3584 
3584 

Qt CaOQtQQQaQQQQCQQCQCCgQQ t QQQt QgQCQQCQQQCQQCQg t QgQt CQQQQCC tQtCQO.CQ.CO.CO.CQO.QQtCO.Qg 3680 

tQgtttottttttttgttgtggcttQtttQtctgttctttgttgtcQtctQgttttggatQgttgtgtggtgtttQgttc 

Vspl 
Asal 

P end H start | 

caqQqtqaQQcaotQqQCQtcaatcQQtata c|aQataaqaQaaa| t t t jqqqat taaoq' o.Qto.tiQ.t toatcct tgtccoQaa 3760 
gtctcQCtttgttatctgtQgttQgttatatgtttQttctttttQQQtcctaQtttcttQtttaQttaggaacQggtttt 



3743 
3743 



Fig. l S 

SUBSTITUTE SHEET (RULE 26) 



WO 98/50405 



8/34 



PCT/US98/09270 



PlQl 

tgQgtQtQQctQQCtctgcQQtQtQCQCQttcccggQgtcQtcQttctctgagQQtggtcQtQtQgQQCCQttQccQCtc 3840 

Qctcatat tgattgagacgt tatQtgtgtQQgggcctcQgtQgtQQgQgQctcttQccagtQtQtct tggtQQtggtgog 

• * * | • • * ♦ • 

3796 

PflMI BanHI 
I I 

QQQgtcQQtgaQCDgagoQQQgcagtQcctcQCQt tagagt tgccQQQQtcggQQQtccQCCQQQQCQtggatcccggtQ 3920 
tttcQgt tact tgtctct tttcgtcQtggagtgtQQtctcQQcggtt ttagcctt taggtggttttgtQcctQgggccQt 

• • • t » • | | • 

3901 3910 

HboII 

Fokl HboII Tsp45I 

III I 
tttggQtgtcttcttQctcggcttcttcgQgQtggQQcgQQtcQQQgQCQQQtacgggagtgtgQQtgQtcttgQCQgtg 4000 
QQQCCtQCQgQQgQQtgQgccgQQgQQgctctQccttgcttQgtttctgtttQtgccctcQCQCttactQgQQctgtcQC 
| |. • | • • ■ I ' 

3924 3944 3998 

3929 

Smal 

Aval Barl EcoRII 

I I I 

QcccgggttQCQQQgtttgtggctctggatcQttaccQatcggQttQgccQQatQCQctgggQQtgaccaggQQttQttQ 4080 
tgggcccQQtgtttcaQQCQCcgQgQCCtagtQQtggttQgcctQQtcggtttQtgtgQCCtttQctggtccttQQtQQt 

i ' * • I ' i * 

4002 4057 4068 

4002 

BsrI HsII HboII 
I I I 

CQQCtQQQCtggQCQtQgQQgtgagQQgQQCQgtCQQQgcgQQQgQQQtgQttgtttQtQCggtQCQQQQtQtQaQQCCQ 4160 

gttgat t tgacctgtQtcttcactcttcttgtcagtttcgct t tctttQctaQCQQQtatgccQtgtt ttQtattttggt 
| • I * | ■ • • t • • 

4088 4094 4105 
Avail 

Styl 

Ncol AccI 

Osal BsrI SfoNI Hfal Oral 

I I I I I II 

gQQctgtacccQtggtccagtagactQagaQQQggaatgttgttcgatgccaacaQagttgctcttgctcctcaatgtct 4240 
cttgQcatgggtQccQggtcQtctgQttctttttccttacQQCQagctQcggttgttcaQcgagaacgaggagttQcagQ 
I * I I I ' * I • ' i ' I * 

4170 4177 4206 4228 4238 

4170 4180 
4170 

4174 
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Hboll Hfal Oral 

I I I 

tccQctQgataggQgcQtQQQQttcQgQgtQQtcttcgttQQttgtQcggcQQttggQtcQQtQQCcttgtttQQQQttc 4320 
aggtgatct atcctcgtat 1 1 taagtctcot tagaagcaat toacatgccgt taacctagttat t ggaacaaat 1 1 taag 

• I *i *l 

4273 4291 4311 

BspHI 

BstXI SfaNI PstI 
II II 

ccQQgtcQQtggcQtcQctQtctctQcccQgcacQQtQtcQQtCQQtctgcaggtQCQCQtcQQQQcaggggttcQgQct 4400 
ggt tcagt tQccgtQgtgatagQgatgggtcgtgt tatagttQgt tagacgtccatgtgtagt 1 1 tgtccccQagtctga 

I * I ' i 1 
4321 4332 4368 

4370 

Fokl HphI Avail Bell 

II II 
gQttctaaagggQtagttcaQQttttggatgagaagggtgQQQQQtcactgaatttcatggtccQtctcggattgQtcQa 4480 

CtQQgQtttCCCtQtCQQgtttQQQQCCtQCtCttCCCQCtttttQgtgQCttQQQgtQCCQggtQgQgCCtQQCtQgtt 

I . I . . I I 

4427 4437 4460 4474 

Seal 
I 

aagaaQagtQggcagaatgtQctctgtcgagtQctgtQQacQgaQQatcgagaQQQtgagQttgatattttctttgggat 4560 
t tct 1 1 teat ccgtcttQcatgogacQgctcatgacat 1 1 gtct t ttagctct 1 1 tQctctaact QtaQQQgQQQCCCta 

4510 

HslI BsqBI p vuII 

HboII BsrI NspBII 

I I I I I 

tagt tggQggaatcQgtcttcQtgtcQQtgcQQCtgatctQtQtcQQQQQCQCtagcQQCtcQgctggtQttcQQQQgg 4640 
atcQQCctccttQgtcQgaagtacagttQcgttgacctQgatatQgtttt tgtgatcgttcagtcgaccataagttttcc 

I 'I 'II' * * I 

4577 4593 4623 

4581 4597 4623 

BsaBI 
I 

gagatttgttatcccttQatggQtctQaatccacatctcQatctQgttatctgggcttcQtcQgtagQgattacQQgagt 4720 
ctctaQQcaatagggQQttacctagatttQggtgtQgagttQgatcaatagacccgQQgtagtcatctctQatgttctca 

• • | • • • • • • 

4662 

SfaNI 

Fokl EcoRII 

II I 

ggQ t gcoa 1 1 1 1 ccqqcc 1 1 c 1 1 1 qcc t ggegag 1 1 CQgQ t ac t a t cc t qqcq 1 1 at t gcaQQQggag 1 1 gggaaaat cq 4800 
cctQcgttQQQQggttggQQgQQQtgQQccgctcQQgtctQtgQtaggQttgtQQtQQcgttttcctCQQCCCttttQgt 
||* » | • . * • • • 

4721 4745 
4722 m 

Fig. h 

SUBSTITUTE SHEET (RULE 26) 



WO 98/50405 



10/34 



Spel 



PCT/US98/09270 



H end 



QQCQQtggQQctQgtQQtctctQttttgQtctggatQtatctQttQQgccaQQgcQQQtQQgQgatjaQtcQQQQ(Ktt|Qg 4880 

ttgt taccttgatcat tQgQgQtQaaactQqQcctQtQtQgQtQQt tcggtttcgt ttQt tctctc ttagtttt t gao tc 
| •••••• • 

4810 

Bbvl HboII 
F start F I I 

gacaQQQdiagtcaataccaacQQctattQgcagccacQCtcgctggQQcaagaQQgQQgggatQQaaQQagtttaQcag 4960 
ctqttttdt tcQgt tat ggt tgt tgQtQQtcgt cggt gt gogcgQCCt t gttct t tct tccctQt 1 1 1 tt tcaaat tgtc 

* • • | • • • • | 

4911 4960 



aQgQQQCQQQQQCaQQQQgCQgQgQQCQCCQgQQCQQCaQgQtCQQQQCQCCCQQCCCQCtCQQQQCgQQQQtCtCQQQQ 5040 

ttctttgt tt t tgt 1 1 1 tcgtgtct tgtggtct tgt tgt tctagt 1 1 tgtgggt tgggtgagt 1 1 tgct tt tagagt 1 1 1 

• • »»••#• 

HsII 

BstXI SfoNI 
II I 

gQgattggcQQCQCQQCQQQCQctgQQCQtc[Q^ccaQCCtcQQtQctgctQattQttQCQQCCQtgattQtggcQtctt 5120 

ctctQQCcgttgtgttgtttgtgQcttgtQgtQcggttggagttQtgQcgQttQQtQQtgttggtQCtQQtQccgtQgQQ 

• • • « • • j j • j • 

5103 5114 
5104 

HsII Hpal HboII 

EcoRV Nspl Hindll Fokl EcoRV 

I II I III 

tctgccQQatQgatatcacaaQQCtQcagcQtgtQggtgtattggttaacagtcccQQQgggatgaagatatcacQQQQC 5200 
Qgacggtt tatctQtagtgttttgatgtcqtQcatccQCQtQaccQQttgtcagggtttccctacttctQtQgtgttttg 

• I * II I * 'III* 

5132 5149 5165 5181 5188 

5150 5165 5185 

Tsp45I 

EcoRV HboII HphI 

I I II 

tttgaQacQagatatcQtattttgQgcctcatQccaQQQQtagaQgattctaQctcttgtggtgaccaacagQtcaagca 5280 
aaQctttgttctQtagQttaaaactcggagtQtggtttttatcttctaQgQttgQgaacaccQctggttgtctagttcgt 

•I * * I *ll 

5211 5243 5261 

5262 

Bell Fokl 

I I 

QtQCQQgQggttQttggatQgactgatcattcctttQtQtgatggQt tQQgQttQcagaQggatgtgQtQgtgtccQQtc 5360 

tQtgttCtCCQQtQQCCtQtCtgQCtQgtQQggQQQtQtQCtQCCtQQttCtQQtgtCttCCtQCQCtQtCQCQggttQg 



5304 5341 

Fig. Ij 
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QQgQQtccQQtgQQQQCQCtgQCCccQgQQCQQQQcgQttctttggQggggtQQttggQQCtQttgctctgggQgtggcQ 5440 
ttcttaggt tQcttttgtgQctggggtcttgttttgctaagQQacctccccattQQcct tgataacgagQccctcQCcgt 



Acil ... CacBI 

NspBII Ml 

II II 
acctcagcQcaaattacQgcggcagttgctctggttgaagccQagcaggcaQgatcagacQttgQQaaactcaaggaagc 5520 
tggagtcgtgtttQQtgtcgccgtcQQcgQgaccQQgttcggttcgtccgttctQgtctgtQQctttttgQgttccttcg 

I I' * II" 

5457 5480 
5459 5485 

Sdul 

Sac I EcoRII 
Bsql HgiAI Bcgl Avail 

I I III 

QatcagggacacaaacaaagcagtgcagtcagtccagagctccatQggaaatttgatagtagcaattaaatcggtccagg 5600 
ttagtccctgtgtttgtttcgtcacgtcagtcaggtctcgaggtatcctttaaactatcatcgttaatttagccaggtcc 
• I * I * * 'I * I I ' 

5543 5557 5582 5593 

5557 5596 
5557 

Hindll Hfel Bbvl Eco57I 

I I II 

QttatgtcaacQaagQaatcgtgccatcaattgcgagattaggttgtgaagcagcaggacttcagttaggaattgcatta 5680 
taQtacQgttgttcctttagcacggtagttQacgctctaatccaacacttcgtcgtcctgaagtcaatccttaacgtQat 
I i ' ' 'l I 

5606 5628 5651 5660 

HphI 
I 

acacagcattactcagaattaQcaaacatattcggtgataacataggatcattacaagaaaaagggataaaattacaagg 5760 
tgtgtcgtaatgagtcttaattgtttgtQtaQgccQctattgtatcctagtaatgttctttttccctattttQatgttcc 

• • • | • • • • • 

5714 

SfaNI Acil 
I I 

tQtQgCQtCQttQtQCCgCQCQQQtQtCQCQgQgatQttCQCQQCQtCQQCagttgQtQQQtQtgQtQtttQtgQtCtQt 5840 

QtQtcgtQgtQQtQtggcgtgtttQtagtgtctctQtQQgtgttgtQgttgtcQQCtQtttQtQctQtQQQtQctQgQtQ 

j • I t • • • « « • 

5765 5776 

HphI Hindll HphI Plal 

I I I I 

tatttacagaatCQataaaggtgagagttatagatgttgQcttgaatgattactcaatcaccctccaQgtcagactccct 5920 
ataaatgtcttagttQtttccQCtctcaatQtctacaactgQacttactaatgagttagtgggaggttcagtctgaggga 
I i • i • " I 

5860 5876 5898 5913 
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Fokl 
I 

ttattaactQgQCtgctgaQCQCCCQgQtttQCQgQgtQgQttccQtQtcctQtQQcatccaQQQCogQgaQtggtQtQt 6000 
aataat tgQtctgacgQCttgtgggtctQQQtgtctcatctQaggtQtQgtQtQttgtaggttttgtctcttQccQtQtQ 

• • • • ■ | • • • 

5977 

HboII 
Ksp632I 

Earl BspHI BsmI 

II I I 

ccctcttcccagccacQtcQtgQCQQQQggggcQtttctQggtggQgcQgQtgtcQQQgQQtgtQtQgQQgcQttcQgcQ 6080 
gggagaagggtcggtgtQgtQctgt tttccccgtQaagatccacctcgtctQCQgtt tcttQCQtatcttcgtQagtcgt 
II • I * * • * * 'I 

6003 6018 6071 

6003 
6004 

EcoRII PflHI 
I I 

gttQtQtatgcccttctgQtccQggatttgtQctQQQccatgQQQtggQgQgctgt t tQtcQggQQQCQtQtcccQatgt 6160 

CQQtatQtQcgggQQgQCtaggtcctQQQCQtgQtttggtQCtttQCCtctcgQCQQQtQgtcctttgtQtQgggttQCQ 

• • | • j ■ • • • * 

6101 6118 

Osal EBseRI 

I I 
ccaQgQQCcgtggttQQQtcQgacQttgttccQQgQtQtgcatttgtcoQtggQggQgtggttgcQQQttgtQtQQCQQC 6240 
ggttcttggcQCCQatttagtctgtQQCQQggttctQtQcgtQQQCQgttacctcctcQCCQQcgtttQQCQtQttgttg 

|* • • • • | • • • 

6168 6213 
Bspl407I 
MslI 

Nspl Nspl Bell 
Afllll Bcgl EcoNI 
II I I I II 
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QQQtQctgtcttQQttggQgQtQtgaatcQQCQgctctQtttggQgQQQCttgcaQCCQQQtQtttggattcQQtQaQtt 10720 
tttQtgQCQgQQttQQcctctQtQcttQgttgtcgQgQtQQQcctctttgaacgt tggt t tataaQCCtaatt tatttaa 

1 * * I 1 " 

10660 10697 

10700 

HphI 

gtttQQttggttQCQCcctcgtcttgaaggQQgtQCQQtctQtgtQggtgQtccctQttgtcctccQtCQgQtQQggaQC 10800 

CQQQttaaccQQtgtgggagcQgQQCt tccttCQtgt tagatQcatccQctQgggatQQcaggQggtQgtctQttccttg 

* * • • | • • • • 

HphI 10767 
I 

QtQtQtcattagaggQtcQccctgattctggattttQtgttcataQcccQQgagggggtQtagQQggQttttQtcQQQaa 10880 

iQtQtQgtQQtctcctQgtgggQctQagQCCtQQttQtQcaagtQttgggttctcccccQtQtcttcctQQQQCQgttttt 

• | • • • • • • • 

10817 

PvuII 
NspBII 

Bbvl BsrOI 
II - I 

ttgtggQCQCtcQtQtctQtQQgtgcQQtQCQtctagcQgctgttQgQQtQggcgtQQgggtQQctgcQQtggttcQQgg 10960 
QQCQCctgtgagtQtQgQtQttcQcgttQtgtQgQtcgtcgQCQQtcttQtccgcQttcccQttgacgttQccQQgttcc 

• • t | | • • • | * • 

10917 10947 
n - 10918 

rig. lu io9i8 
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QgQtQQtcQQgctQtQgctgtQQCQQCQQgagtQcccQQCQQttQtgactQCQgQgttQQgQQggQgatQgtttQtQQQg 11040 

tctQttagt tcgatQtcgQCQttgt tgttctcQtgggttgttQQtQctgatgtctcaQttct tcctctatcQQatat ttc 

• • t • • • • • 

HphI Fokl BspHI 

I II 

QtgtggtgagQttttttgQttcQttQagQgQQgtQQtggQtgQtctQggtCQtgQQCttQQQttQQQtgQQQCQQttQtQ 11120 

tQCaCCQCtCtQQQQQQCtQQgtQQttCtCt tCQttQCCtQCtQgatCCQgtQCttgQQtttQQtttQCtttgttQQtQt 
I • • • I • I • • • 

11045 U078 11090 

MboII 

EcoRI Xbal 

I I I 

QgtagcQQgQtgttcQtQtQtQgcQQQQgQQtQtQttQcgQtgggQgQQttcttccccQQgctctgaQQgcQttQtctQg 1 1200 

tcatcgttctQCGQgtQtatatcgt t t tct tQtQtaatgctQccctcttQQgaQggggt tcgagQCt ttcgtaatogatc 

• • * • | • j • * | • 

11167 111% 
11171 

SfaNI 

MboII BsraAI BsqBI MboII Hfel 

I I III I 

Qtgtgtct tctggtcagagQcagtaat agac gaaacaaga t cagco t c 1 1 caaac 1 1 g gcaaca t cq 1 1 1 gcaaaagcaa 1 1 280 
tQCQCQgQQgQCCQgtctctgtcQttQtctgctttgttctagtcgtQgaagtttgQQCcgttgtQgtaQQcgttttcgt t 
I . I . . Ml- • • i • 

11206 11217 11239 11247 11278 

11244 

SphI 

EcoNl Nspl Fokl 

HphI Cac8I Styl BsqBI 

II I I II 

t tgQgaatggttQt tcacctgttctQggatQtgcQtgctcQQtttttQQgaQcattcQQCQQCtQtQtattgcccttggg 11360 
QactcttaccaatQQgtggacaagatcctatacgtacgagttaaaaattcttgtaagttgttgQtatatQacgggaaccc 
I 1 • ' I ' * 'III 

11295 11313 11354 11360 

11298 11313 11359 

1,313 SfaNI 

Fokl SfaNI 
II I 

Qt gaatQtcaQtccaactQtaQCQcagaataQcaaagat t tatat 1 1 taggaatccaaat tggatgcaatatgcatct 1 1 1 1 440 
tact tQtagttaggt tgatat tgtgtct tatagt t tctaaatataQQatcct taggt ttaacctacgt tatacgtagaaQ 

• • • ♦ • * I I * I * 

11422 11433 
11423 

Ball 

BspHI Ms 11 Sspl 

i ii r 

ao.tacctgcto.gtg t tgggggat tcaat tacatggccatgtcaagatg 1 1 1 tgtaaggaatat tggcgatccatcagt t g 1 1 520 
ttatggacgatcacaaccccctaagttaatgtaccggtQcagttctacaaaacattccttataQccgctaggtagtcaac 
I * "I I ' I* 

Fig. Iv 
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11472 
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Acil SexAI 
I I 
ccgcat tagctgQtQt tQaQQgQt ttQttQQggcgaQCCtQttQgaccgaagtgttctt tatQggQt tQtgaQtcQQgQQ 11600 
ggcgtQQtcgactQtQQttttctQQQtQQttccgcttggQtQQtctggcttcQCQQgQQQtQtcctQQtQcttQgttctt 

11521 11600 
Plel 
HphI 

EcoRII BsrI Eco57I Ms 1 1 

I I I II I 

ccaggtgQgtcQtcttttttggQctgggcttcQgQcccQtQttcQtgcQQtttoccQCQQtctcaQQQtQtQQccQCtat 11680 
qgtccQC tcQgtagQQQaQQect gacccgaagtct gggtagaagtacgt tQQQtggtgt togagt 1 1 tatat t ggtgata 

III* • I r i * 

11601 11623 11629 11638 

11604 

11607 HphI 

I 

gQtQQQQQQtQtQQCQgCQQgQQQtgtQttQCQQgQttCQCCgQQtCCQttQttQtCtggQttQttCQCQaQtQCQQtgQ 11760 

ctQttttttQtQttgtcgttctttQCQtQQtgttctQQgtggcttQggtQQtQQtagQcctQQtQQgtgtttQtgttQCt 

* * • I • • • • • 

11718 

HboII 

Mbol! HboII Xbal 

III I 

tQgaagQQgQt gaagQQt t agct gQgt tct t gQtggacQggQQggtQQt tct ccctQgagt tgcQCQtgatot tctogat 11840 
QtcttcttctQcttcttQQtcgQctcaagQoctQcctgtccttccattQagQQggQtctcaacgtgtQctQtQagQtctQ 
I I * I • " * * * I 

11763 11772 11834 
11766 

BseRI 
I 

QQttctctcQCQggaQtcagQQQtgctQtagctggQQtgttQgQtacgQCQQQQtctctQQttcgggttggcatQQQtQg 11920 
ttQQgQgQgtgtccttagtctttQcgQtQtcgaccttQCQatctQtgctgttttagQgattQQgcccQQCcgtatttQtc 

11920 

aggaggactgacQtacagtttgttgaggaQQatcagtaQttacgQtctagtacQatQtgQQacQCtaagtaggactttgc 12000 
tcctcctgactgtQtgtCQQQCQQCtccttttQgtcQttQQtgctagQtcatgttQtoctttgtgQttcQtcctgQQQcg 

Hgal 

HboII AccI BsrOI 

I I II 

gQCtQQttgtQagcgacQQaatcaggtQtgQogatatgtgttcggtQgaccttgctQtagcattgcgtcaaQQgQitgtgg 12080 
ctgattQQCQttcgctQttttQgtccQtQcttctQtQCQCQQgccQtctggQQcgQtatcqtaQcgcagttttctQCQCc 

I I '1 I ' 

12030 12045 12061 

12065 



Fig. In/ 

SUBSTITUTE SHEET (RULE 26) 



WO 98/50405 



PCTAJS98/09270 



24/34 



Fokl HphI 

I - I 

QttcQtttQtcaggQggQQggatgQtQQgtggQcttgaQQCQCCtgatccQttQgQQttQctQtctggggtgQtQQtQQC 12160 
tQQgtQQQtQgtcctccttcctQctQttcQCCtgQQCtttgtggQCtQggtQQtcttQQtgQtagQccccQCtQttQttg 

12100 12149 
Eco57I 

HboII BstXI Fokl Sspl 

II II I 

QggQtcggQQCQttgtQaQQtQtgttattcttcagQtggcQCQQQCccQtQtacttggQtgtQtttQccgggtQatQtta 12240 
tcctagccttgtQQCQttttatQCQQtQagaQgtctQccgtgtt tgggtatatgaQcctQcatQQQtggcccQttQtQQt 

ll • i ' I * 1 I 

12189 12207 12217 12234 

12190 

Tsp45I 
AMI Bqlll 
I I I 

QQQtQggotcagcQgaQQCQggtQtQtcQtcQt tgagagt tcct tat 1 1 tggQtcagtcQctgQtgagQgQtctgaggca 1 2320 
t ttatcctagtcgtct ttgtccQtQtQgtagtQQctctcQQggQatQQQacctQgtcQgtgQCtactctctQgQctccgt 
• • • • • | | • | • • 

12295 12309 
12297 

Hfel BspHI Acil BsrOI 

I III 

CQQttgggQtQtQtcQagQQtcttagtQQacctgcQQaQgccgcQQtQQgQQtagcQQtgQtQtQtQCQtgggcQtttgg 12400 
gttQQccctotQtQgttcttagQQtcQtttggQcgttttcggcgttQttcttQtcgttQctQtQtatgtacccgtQQQcc 

l • 1 'I ' I * 

12321 12350 12361 12375 

Prill 
BsqAI 

EcoRV Fokl M AfUII BsmAI 

III II I 

tQQtgQtgagQtQtcttggQtggQggcctcQCQQatQgcQcaQQCQcgtgcQQattttQCQCtQgatagtctcQQaattc 12480 
attactQctctatagQQcctQcctccggQgtgtttQtcgtgtttgtgcacgtttaQQQtgtgatctQtcQgagttttaag 

I I* I ' ll ' ' I* 

12410 12418 12424 12444 12469 

12445 

12445 _ f 

BsrI 

Age I Gsul 
I I I 

tQQcaccggtQgctQcatcQQCQQQtttQtcQCQCQgQttQQQggQtagtgcQQCccQgatgQQgttctccQgtQcatcQ 12560 
attgtggccatcgQtgtQgttgtttQQQtQgtgtgtctQQtttcctQtgQcgttgggtctQcttcaagQggtCQtgtQgt 

1 i I 

12485 12548 

12550 



Fig. Ix 
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Eso3ll 

Nspl BsnAI 

MslI AflHI BsqI BsqBI 

I I II 

ttgattQgggtcagcQgQttcQtQQCQQtgtccQQtgQtQQCQtgtctQtcQaggQQgctQQtgagQccQQQgatQccQa 12640 
QQCtQQtcccQgtcgtctQQgtQttgttQCQggttQctQttgtacQgQtQgttccttcgattQCtctggtttctQtggtt 
., . .| . . | • | 

12581 12601 12624 12633 

12601 12624 

12624 

Hpal Sspl 
Hindll MM HboII 

I II I 

tcttQtttQtcQQCQQQtQQtgttQQCQggQttQQgtgttttcgQQtQtttQtttQgattQgaagQQQCCQCQggQCQCQ 12720 
aqaataaataq t tgt t tat tQCQQt tqtcc tacit tcacaaaaqct tataaataaatctaatct tct t tggtgtcc tgt gt 
. , . Ml* 'I 

12662 12681 12702 

12662 12685 
BsqI Vs Pt AccI 

Ms 1 1 SfaNI MboII Asel Plel 

I I I I III 
QCCccQtQgttQtgcQtctgcacatagaagQtgQgtgt tgcat tQQQgQQQgttttQQtgQtgQgcatQttaQtccQgQg 12800 
tggggtQtcQQtacgtQgQcgtgtQtcttctQctcQcaQCQtaatttctttcaQQQttQctQctcgtQtQQttQggtctc 
, . | | . | . . . • |. || 

12725 12734 12747 12789 12798 

12738 12789 12800 

Vspl Kpnl BsrBI 
Asel BanI Acil 
I I I 
tctQCQttQgQQttQQttQggtQccctgQQQgtQQtgQQtttQtttQtgQtQQQgacccgctcQQggQcgtggQcttQtc 12880 
QgatgtQQtcttQQttaQtccQtgggQctttcQttQcttQQQtQQQtQCtQtttctgggcgQgttcctgcQcctgQQtQg 

• I I ' * i 

12812 12820 12858 

12812 12820 12858 

Hfel Fokl Nspl 

I I I 

QQQQGttQtggttQttQQQgQtcattcttQCQCQQttgatatgQQttQttgggQtgatQCtgQCQtcatacQtgcQQttt 12960 

ttttgQQtQccQQtQQtttctQgtQQgQQtgtgttQactQtQcttQQtQaccctactQtgQctgtagtQtgtQcgttQQQ 

• # « | • . | | * 

12913 12932 12950 

M Tsp45I BsrOI 
I I I 
cQQtatgtQCtgcQQttQCQQtQgcagacQCtQtgtcQCQQttQgQtcgQgQtQQCttQQQQgQgQtQQtQgtCQitgcQ 13040 
gttQtQcatgQcgttQQtgttQtcgtctgtgatQCQgtgttaQtctQgctctQttgQQtt ttctctQttatcagtQacgt 

• • | « | « • • " i * 

12987 12995 13034 



Fig. ly 
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Vspl 

Asel Plel AF1II 

I I I 

aatgatgatgatattQQtagcttQatcactgaatttttgQctct tgatatQcttgtatttcttaagacatttggtggatt 13120 
ttQctactQctQtQattatcgaQttagtgacttQQQaactgagQQctQtQtgaQCQtaQQgQQttctgtQQQccacctQQ 

• j • • |* • • j • t 

13053 13079 13101 

l305 3 Avail 
Oral PpuMl 
I II 

attagtQQQtcQQt t tgcatacQCtctt tatQgtttQQQQQccgaaggtQgggQcctcQt t igggattatataatgagaQ 13200 

taatcQt t tagt taaQcgtQtgt gQgQQQtQtCQQQtt 1 1 tggct tccQtccctggagtQQaccc taatatat tQctct t 

• » • | ♦ • 1 1 » • • 

13154 13171 
13172 

Sspl Fokl 

I I 
CQCtgagQgQtQcttcccQttcaatattQQQQgtQttQtctQQtgcQt tatctcatcctQQQgtQttcQQgaggttctgg 13280 
gtgactctctatgaQgggtQagttQtQQttt tcatQQtQgQttQcgtQQtQgQgtQggQtttcQtQQgttctccQQgacc 

• i | • • • | • * t 

13223 13254 

Plel Sspl 

I I 

gttttgtggQgtcttQQQccctQtttQtggccctQQtQctgctQgtcQagaccQgQtQaaacttgccctctctQtQtgtgQ 13360 

ctaacacctcagQQt ttgggatQQQtQccgggattQtgQcgatcQgt tctggtctQttttgQacgggagagQtQtQCQCt 
j < • t • • • ■ | 

13288 13360 
Bgill Tsp45I 

QtattcQCtagotctat ttQtgQgQgaatggt tgQQtggtgtatcact tgaaatatacat ttgtgacagcgQtQtggQQg 13340 
tQtQQgtgQtctQgQtQQQtQctctcttQccQQCttQccQCQtQgtgQQctttQtatgtaaacQctgtcgctQtQccttc 

I • • t • • I ft ft 

13370 13423 

Xbal SfaNl Avail 

I II 
ttgcgaatgataggaaQcaagcctttatttctagacacctttcatttgtttgttgtttagcagQaattgcQtcttttgga 13520 
aacgcttQctatcctttgttcggQaataQagatctgtggaQagtaaacaQacQacaaatcgtctttaacgtagaaaacct 

• • | • • • j« | • 

13470 13509 13518 

BsnAI Sspl 
I I 

CCtQQCCtgttQQaCttQQCQtQCttQgQgQgQCttgQtCtQttgQQQCQQtQtCttgQQttQQQtQttQQQgQCgQCCC 13600 

ggQttggQCQQtttgQQttgtQtgQQtctctctgQgctagQtQQCtttgttQtQgQQcttQQtttQtQQtttctgctggg 

• • | • • • | « « 

13550 13584 



Fig. Iz 
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Pad 

Vspl Mi 
Bspl407I Asel BsaAI 

I II I 

cattcttQQQtQtgtQcaQQtQtctggQttQttQQttQQQtcgttcccQtcQQCtgtQQcatQcgtQQgaQQgactgcQQ 13680 
QtgagQatttQtQcatgtt tQtogacctQQtQQttQat ttagcQQgggtQgttgacattgtatgcottct ttctgacgtt 

13613 13631 13662 

13631 13662 

13632 

BantHI 

Sen I Bsu36I BsqBI MboII 

| P I III 

tcaaatatt tQaggattcgtggtattQgtccQcctgaggtQQttgQtgat tgggatccgQtQgQagatgQQQQtatgctg 13760 

aqtttQtQQQttcctQQgcQCCQtQQtcQggtggQctccattQQctQctQQccctQggctQtcttctQcttttQtQcgac 

I • • i r I • i 

13584 13713 13728 13743 HaeII 

13733 Eco47III 

I 

gQtQQCQttgtCQOQQCtQtQQQtgQtQQttgtQQtQQQgQtQQtQQQgggQQtQQaQttQQCQQtttCtggggQCtQgC 13840 

ctat t gtaacagt 1 1 tgat a 1 1 tactat taacat tat t tctat tat t tccc t tat 1 1 tao.t tgt taaagacccctgo. teg 

13838 

Avail 13838 
Aflll PpuHI Ball I SfaNI Agel 

I II I II 

gcttQQgQQctatcQggtccttQQQQtcQgatctQtQQCQQgtgQttctgQtQQtQQtgQtagatcagQtgctQgtQccg 13920 
cgaQt tcttqQtQgtccQggaattttQgtctQgQtQttgttcQCtQQgactattQttQctQtctQgtctQcgatcQtggc 
| • || • I* • • " 1*1* 

13842 13855 13869 13908 13917 

13856 Hfel Spel 

gtggtttgacQCttcctcQQggggggQQttQtctQtcQcatcaQttgQgQttQttcggQQtcQQcagcQCtQgttgtctg 14000 
cQCCQQQCtgtgQQggQgttccccccttQQtQgatagtgtQgttaQctctQQtQQgccttagttgtcgtgatcQQCQgac 

• • • • | * | * * 

13962 13989 
Styl 
MboII 
Ksp632I 
Earl 

EcoRIl Sap I Avrll 
I 111 I 

QQQgctcttgagttQtcQCQQQttttQQtgaQggaQgttQQtQQQgaccQggacaggctcttcctQggQgQQggQgcQgg 14080 
tttcgagQQCtcaatagtgtttQQQQttacttccttcQattQtttctggtcctgtccgQgaQggatcctcttcctcgtcc 

I * III* i 
14048 14057 14063 
14058 
14058 



Fiq. Iaa 14059 



14063 
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PstI 

Nspl BspHI 

Nhel Avail 

Cac8I SfoNI PpuHI 

II I 1111 

age t Qtgctagcatgt tat gQtgccacQt taggacc tgcagt taQt tat tatQat t ccggt t tgaatataacagat gtaa 14160 

tCgQtQCgQtCgtQCQQtQCtQCggtgtQQtCCjg^QCgtCQQttQQtQQtQttQQggCCOQQCttQtQttgtCtQCatt 

14087 14100 14111 
14087 14112 
14091 14114 

14116 

Sspl 

Hind 1 1 XmnI Eco57I Tsp45I 

I I I I I 

ttggtcQQcgQgQQttgQQQQtQttcccttcQgQggtQtcQttQgtaggtQQQQQQttQggQQQtgtgacQCQgQtcctt 14240 

QaccagttgctcttQQctt ttQtQagggaagtctccQtogtQQtcatccQttttttaQtcctttQCQCtgtgtctQggQQ 

14164 14177 14188 14226 

14180 BsinBI 

Seal Xcnl Bsnl BsmAI 

I I II 

aatQgggtQQQQgtQctgttcQQtggQQQtccQQQttcQQCQtggQtQggQQQtQtggQQtgcgQgQcgttQQtQtggQg 14320 

ttQtCCCQttttCQtgaCQQgttQCCtttQ^gtttQQgttgtQCCtQtCCtttQtQCCttQCgCtCtgCQQttQtQQCtC 

14252 14271 14298 141304 

141304 

XmnI 
MboII 
I I 

tgQQttgaQtgQtaagtctQt tggQttggtQCQttgtgatatggaaggQgctQtcggtQQQtcQgQQgaQQCtgttctac 14400 

QCttQQCttQCtQttCQgQtQQCCtQQCCQtgtQQCaCtQtQCCttCCtCgQtQgCCQtttagtCttCttCgaCQQgQtg 

14385 
14388 

HslI Fokl 

QtgQQCQCtatagtgttQtQQgQQttQCQtQCttgQttggggQtgQtgQtgttgttttQQtttccQQQQttQtQCCtQCQ 14480 
tacttgtgatQtcQCQQtQttcttQQtgtQtgQQctQQcccctQctQctQCQQCQQaattQQQggttttQQtQtggQtgt 

14406 14441 
Xbal 

QtcQCtccgQQttggtctQgQQtQctttQtctQtQtQQgttQtQttggQQQgattgaagtQtQQtatcQCttQQQQCttc 14560 
tQgtgQggcttQQccagQtcttQtgQQQtagQtQtQttcaatataacctttctQCQUcatattQtQgtgaQttttgaag 



14496 



Fig. 166 
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SfoNl SfoNI 

toQtcctqcQtcQQCQgQQttQtQtctQQtttcQQQQgQtgcgtQttgtQctQtQQtggaacctagtgQQgttgttttQt 14640 
attQggacgtQgttgtcttQQtQtQgQttQaagttttctQcgcQtQacQtgQtQttQccttggQtcQCttCQQcaQQatQ 

14568 14598 

Tsp45I HboII HboII 
I I 1 

CQQQQCttQQQagQttgtCQCtCttggQQgQQQatQQtCtQttQaQQtggQtCQttttQtCQQQgaagQQQQQtQQtgQQ 14720 

gttttgaattttctQQCQgtgQgaaccttcttttQttQgQtQQttttQcctQgtQaaatogtttcttctttttQttQctt 

M657 14667 14705 

Eco31I 

BsnAI 

BspHI Bsal Ndel 

| II 
tggttQcatcQtgQQQtcQQQgQQggggQQQgagQttatggQgttatgQgQCCQtatcQtQtggcQttacQQQtttttgg 14800 

accaat gtag tac 1 1 tagt t tct tcccct t tctctaatacctcQatQCtct ggt Qtagt ataccgt aat gt 1 1 qqqqqcc 

1472g 14768 14778 

14768 
14768 

Oral Bglll 
Swal Gsul ^ 

otttcaQQtCQatttQQQtcQtctggcgQOQgaatttttQtcQQCtcragatctgoctMtatcQQCQQtQtQatccoQQ 14880 
taaag 1 1 tagt taoat t togtagaccgct t tct toaoaQtogt tgaggtctagactgat tatogt tgt tatat toggt 1 1 

14912 14845 
14813 Vs j H849 

Fokl Asel BspHI BsnAI Acil 

I III I 

GttttcagQgQQCQQtcaQggatgttttgtttgQotggQttQQtQtQactcQtgatggtQQgQQgcQtQQattQggcggg 14960 
CQaQQgtctcttgttQgttcctQCQQQacaQQcttQcctQQttatattgQgtQctQccattctctgtQtttQQtccgccc 

149Q0 14919 14930 14942 14956 

14919 

Spel 

HboII 
Bbsl ( 

QqatQtQQcatQttcccQctgQQQaQtQaggggQQQttQagactgctQtcgQgQQgQctQgtattQQgttggatttcQtt 15040 
tctQtQttgtatQagggtgQctttttQttcccctttQQttctgQcgQtQgctcttctgatcatQQttcQQCCtQQQgtQQ 

'II' 

15013 

^ 15013 

Fig. Ice i5on 
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Plel 

I 



Sdul 

HgiAI Bsrl 



gtcQttQtcgactcgQttQcttQCQggtcgttttcctgQtgaQQQQtttgQQcatQgagcQCQgQctggQtQtgtgtCQt 15120 
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